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ABSTRACT 

We find that, even in the presence of discreteness noise, a Gaussianizing transform (producing a 
more-Gaussian one-point distribution) reduces nonlinearities in the power spectra of cosmological 
matter and galaxy density fields, in many cases drastically. Although Gaussianization does increase 
the effective shot noise, it also increases the power spectrum's fidelity to the linear power spectrum 
on scales where the shot noise is negligible. Gaussianizing also increases the Fisher information in the 
power spectrum in all cases and resolutions, although the gains are smaller in redshift space than in 
real space. We also find that the gain in cumulative Fisher information from Gaussianizing peaks at 
a particular grid resolution that depends on the sampling level. 

Subject headings: cosmology: observations — large-scale structure of universe — methods: statistical 



1. INTRODUCTION 

The power spectra of fluctuations in the matter and 
(more observably) galaxy fields carry important cosmo- 
logical information. On large scales and early epochs, 
where the fluctuations are small and Gaussian, this in- 
formation is preserved from early epochs, each Fourier 
mode having evolved linearly. On smaller scales, when 
the amplitudes of fluctuations grow to > 1, the lin- 
ear approximation breaks down. Modes of the over- 
density 5 become coupled, and their evolution becomes 
much harder to model. Inconveniently, we need high- 
order perturbation theory and numerical simulations to 
model the expectation value of the power spectrum ac- 
curately. A more fundamental problem is that the cos- 
mic (co)variance in the power spectrum acquires a dom- 



"translinear" scales (Mciksin & White 1999 


Scoccimarro 


et al. 1999 


Cooray & Hu 2001 ). This has unpleasant con- 



fled by a "transli near plateau" in cumulative Fisher infor- 
mation con t ent ([Rimes &: Hamilton|2005| [2006| |Neyrinck| 
eFaLl[2006l fNeyrinck k SzapudiP007[ |Lee & PenPUUgf 



Takahashi et al.||2009[ ). 



Recently, we found that performing a logarithmic 
transform on the matter overdensity 5, i.e. using ln(l + <5) 
instead of 6 as a density variable, drastically reduces the 
nonlinearities on trans linear scales in the power spectrum 
(Neyrinck et al. 2009 Paper I). The logarithmic trans- 



form pushes the translinear plateau to scales about 2-3 
times smaller, revealing about 10 times more Fisher in- 
formation. It also gives a power-spectrum shape intrigu- 
ingly close to the linear-theory prediction. The density 
field 1-point PDF (pro bability density func tion) is ap- 
proximately lognormal (Coles & Jones 1991); in fact, we 
found that an exact Gaussianization of the PDF (de- 
scribed below) performs even better. 

PDF Gau ssianization in lar ge-scale structure was first 
proposed by Weinberg (19921, although not explicitly to 
increase power-spectrum information content, but to rc- 
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construct the initial density field. |Croft et al. ( 1998 1 
used PDF Gaussianization in processing Lyman-a forest 
data from quasar spectra, but this turns out not to be 
an essential step in estimating small-scale power spectra 
from this data, because radiative transfer al ready maps 



the o verdensity into a narrow range of flux ( Croft et al. 
[19991 ). 

Although PDF Gaussianization impressively recovers 
the shape of the initial power spectrum, the transfer 
mation is less-successful in reconstr ucting initial mo de 



prep ). 



in 



by-m ode phases and amplitudes (Neyrinck et al 

This is because of bulk motions of matter on 
1 Mpc scales, and formation of the cosmic web. 
For example, the initial and final PDF-Gaussianized 
fields shown in Fig. 1 in Paper I look by eye quite differ- 
ent on small scales. Precise reconstruction of the phases 
and amplitudes of translinear Fourier modes appears to 
require the accurate estimation and subtraction of the 
Lagrangian displacement field (e.g., |Brenier et al .||2003 
~ 20071 



Eisenstein et al. 



Lavaux et al. 2008; Noneta 



ne 

al' 



2009). With a Lagrangian reconstruction, it is obvious 
that the shape of the linear power spectrum should be 
reconstructed on translinear scales, but the methods are 
much more complicated and computationally intensive 
than a simple density PDF transform. 

It does make sense intuitively that PDF Gaussian- 
ization should help the power spectrum to describe a 
field. While the cosmologically useful information in 
a Gaussian field is entirely in Fourier amplitudes, the 
information in a non-Gaussian field is partly in phase 
correlations, which are necessary to describe features 
such as sharp density peaks. Thus, flattening peaks re- 
stores information to the Fourier amplitudes from the 
phases. Phase correlations affect higher-order statistics, 
not the power spectrum, so Gaussianization can be seen 
as pulling information from higher-order statistics into 
the power spectrum. 

In the approximation that a non-Gaussian field is a 
non-linear transformation of a Gaussian field, PDF Gaus- 
sianization will produce a Gaussian field, vanquishing 
all higher-order correlations. Conversely, subjecting a 
Gaussian field to a non-li near transfor mation produces 
higher-order correlations (Szalay 1988). In particular, 



over length scales where the two-point correlation func- 



2 



tion is positive, a monotonic transformation will gener- 
ally produce a positive four-point function, which indi- 
cates a positive non-Gaussian contribution to the covari- 
ance through the trispectrum. Thus it is plausible that 
much of the covariance on small scales is purely from the 
non-Gaussianity of the PDF. Perhaps a related statistic 
to the power spectrum of t he Gaussianized field is the 
copula ( Scherrer et al.|[2010 ), which is similarly immune 
to monotonic transformations on the field it is applied 
to. 

Despite the promising results, there remain issues to 
be resolved before PDF Gaussianization can be used in 
practice. In this paper, we investigate discreteness noise. 
For a logarithmic transform, the problem becomes obvi- 
ous when there are cells with zero galaxies, which would 
transform to — oo. We first investigate the ideal case of 
Poisson noise in the matter power spectrum, and then 
the galaxy power spectrum. We also make a start at 
exploring the effect of rcdshift distortions. 



2. GAUSSIANIZING TRANSFORMATIONS 

There are many possible meanings of "Gaussianiz- 
ing." For example, Zhang et al. (2010) split a density 
field into Gaussianized and non-Gaussianized compo- 
nents based on distributions of wavelet coefficients, and 
showed that the Gaussianized component of the matter 
density field carries somewhat more Fisher information 
than the full field. In the present paper, by "Gaussian- 
ization" we mean PDF Gaussianization, i.e. a function 
applied equally to each pixel that reduces the higher- 
order moments of the one-point distribution of the field. 

We use a simple approach, first estimating the den- 
sity using simple Nearest-Grid-Point (NGP) mass assign- 
ment, and then Gaussianizing. Perhaps some gains in in- 
formation on small scales could come from using a higher- 
order mass assignment scheme, or an interpolation natu- 
rally suited to discrete data, for exa mple the DTFE (De- 
launay Tessel lation Field Estimator, |van de Wcyg aert fc| 
|Schaap|[2009[ ). Sophisticated techniques have even been 
developed to estimate the ln(l+<5) field directly (e.g 



Ki- 



|taura et al.||2010[ [Weig fc Enfilin||20To| . Here we choose 
JNGP for its simplicity, and for the simple, constant form 
of its shot noise, at least for ideal Poisson data. More- 
sophisticated techniques could perform (or inform) even 
better. 

The two transformations we consider are "exact" 
Gaussianizati on, G(5), and a m odified logarithmic trans- 
form, log + (<5). Seo et al. (2010) have dealt with the prob- 
lem of log-transtormmg zero cells by introducing a den- 
sity floor, i.e. adding an arbitrary small, positive param- 
eter to the argument of the logarithm. This alternative 
modified logarithmic transform did succeed in boosting 
the Fisher information in the lensing-convergence power 
spectrum. There are many possible alternative Gaussian- 
izing transforms. One commonly used transform used for 
producing a Gaussian d i stribu tion from a Poisson distri- 



bution is the Anscombe ( 1948[ ) transform, but this trans- 
form is not ideal in our case. While the separate density 
PDF's for each pixel, over different realizations, should 
be Poisson in our case, the global pixel density PDF will 
generally not be Poisson. 

Our first transformation, G(S), is the density one 
would expect from an exactly Gaussian PDF with the 



same ranking of cell densities as 8. Explicitly, 
G(S) = v^crerr 1 (2/ <5 -1 + 1/N), 



(1) 



where / <( 5 is the fraction of cells less-dense than S in the 
density field, a is the standard deviation of the Gaussian 
that S is mapped onto, and N is the number of cells. If 
there are multiple cells with the same S, as usually occurs 
in Poisson-sampled <5's, then there will be some range of 
G(5) that is mapped to cells with the same 8. In this 
case, the actual G(6) that we assign to these cells is an 
average of G(S) over this range. 

On the other hand, a drawback of G(S) is that it is 
globally defined, nontrivially depending on the entire S 
field. Also, the implicitly defined G _1 function need 
not be well-behaved, complicating attempts at predict- 
ing statistics of G(8) analytically. So, we also investigate 
a modified logarithmic transform, which only depends 
globally on S through the mean density. We define 



5 > 
otherwise 



(2) 



3. POISSON-SAMPLED MATTER DENSITY FIELDS 

First we investigate the simple case of exact Poisson 
discreteness noise, in the matter field investigated in Pa- 
per I. We Poisson-sam ple the density field of the Millen- 
nium Simulation (MS, Springel et al.|2005 ) on a publicly 
available 256 3 density grid at z = 0, at various sam- 
pling levels, from n cc n = 1/64 to n CG n = 64 particles per 
(2 k- 1 Mpc) 3 cell. (The full sampling level of the MS is 
"■ceil ~ 600.) To Poisson-sample, we simply set the num- 
ber of particles in a cell to a random Poisson number of 
mean equal to the full-sampling density. 

3.1. Effects on the mean 

It is well-known (e.g., Peebles 1980) that particle dis- 
creteness produces a white- noise 1/n shot noise in the 
power spectrum, where n is the number density of parti- 
cles. 

Fig. IT] shows shot noises of S and G(S), estimated as 
the difference between power spectra of the density fields 
with and without the added discreteness noise, measured 
from the MS matter density field on a 256 3 grid. When 
Pg and Pi og + (the power spectra of G(S) and log + (<5)) 
are plotted in this paper, we multiply them by constants 
to line them up with P$ in the lowest-fc bin. For Pg, this 
is equivalent to setting the a used in Eq. |TJ). 

For P$, as expected, the simple 1/n estimate works 
quite well. The shot noise in Pg, on the other hand, car- 
ries some slight scale dependence, and is generally greater 
than the Pg shot noise. Intuitively, Gaussianization in- 
creases the shot noise because it increases the contrast 
between low-density cells. 

The green dot-dashed curve in Fig.[TJshows an estimate 
of this shot noise. It was calculated from a histogram of 
G(5), using the empirical expression 



l/7leff = Vcell f(6i){6 l+ i - Si), 



/ J 



(3) 



substituting G(S)i for cV Here, V ce u is the volume of a 
cell, and f(Si) is the fraction of cells with S — Si, for 
density bins i. 
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Fig. 1. — Poisson shot noise in the power spectra of 5 (black) 
and the Gaussianized G(S) (green), using the MS matter density 
field. Dashed and dotted curves show Pg and Pq at n ce n = 1/8 
and full sampling, respectively; solid black and green curves show 
their differences. At low k, the absolute values of these differences 
are shown with light dotted lines. The solid red curve is the initial- 
conditions (linear) power spectrum, multiplied by a factor to line 
up with Ps in the lowest-fc bin. The dash-dotted line is the shot- 
noise estimate in Eq. (J3J> . (Some curves described here do not 
appear in the figure legend.) 

Eq. ([3]) is motivated by the low-density tail of the den- 
sity distribution, where the G function stretches the con- 
trast. For example, in a density field with cells of only 
or 1 particle, l/n cff = V ce u(h - S ) = VAn(l /Articles - 
0/iVparticies) = l/n. When this density field is trans- 
formed by G, the shot noise increases, proportionally 
with [G(6)i — G(S)o]/(Si — 5q). A simpler approxima- 
tion than Eq. ([3| would be to use only i = 0,1 in the 
sum (as in the preceding example), but we found that 
Eq. pi) works a bit better. 

Fig. [2] explores the shot noise in P G and P\ ag + with 
varying sampling levels on a 256 3 grid. The shot noise in 
Pog+ at high sampling and high k is generally smaller 
than in Pq; however, the shape of Pi og+ 's shot-noise 
curve is less consistent than Pg's. This hints at the 
higher (co)variance in P g+ than in G, which will be 
discussed further in the next subsection. The estimate 
in Eq. Q works well for low sampling, but overestimates 
the shot noise if the sampling is n cc u > 1, especially in 
the Pq case. 

Fig. [3] shows how the shot noise varies with resolution, 
at a fixed sampling. Generally, especially for Pq , the shot 
noise is rather consistent for different resolutions. In fact, 
the approximation that the shot noise is constant over 
different resolutions seems to be a better approximation 
than the one in Eq. (§, so we will use it when we deal 
with galaxies (with no easily measurable "no-shot-noise" 
power spectrum). 

The bottom panel of Fig. [3] shows the "nonlinear trans- 
fer function" P(fc)/P; n i t (fc) for Pq, raw and after sub- 
tracting shot noise, and for Pg (after subtracting shot 
noise). 

3.2. Effects on Information Content 



Fig. 2. — Shot noise in the power spectra of G(S), and log + (<5), 
with varying n ce n, the mean number of particles per cell on the 
256 3 grid. From bottom (magenta) to top (blue), the number of 
particles per (2-/i _1 Mpc) 3 cell varies from 64 to 1/64, in multiples 
of 8. Solid curves are power spectra of the transformed Poisson- 
sampled fields, and the black dotted curves are power spectra of 
G(8) and log_|_(<5) with the full MS particle sampling. Solid curves 
are multiplied by factors to line up with the dotted curves in the 
lowest-A; bin. Dashed curves show the differences between the solid 
and dotted curves, and the dot-dashed lines show the shot noise 
estimated from Eq. 

formation in the power spectrum. The cumulative Fisher 
information in the power spectrum about parameters a 
and (3 over a range of power-spectrum bin indices i € 1Z 
is estimated as 



F a0 (1Z) = 



dhxP- 
da 



ainP; 



d/3 



(4) 



where C-r is the square submatrix of C with both indices 
ranging over 1Z. is the covariance matrix of the power 
spectrum in bins, C tJ = (AP- sn APj sn ) /(Pr sn P; sn ) = 

(AlnPA n AlnPA n ). 

In Paper I, we considered the signal-to-noise ratio S/N, 
the information in the power spectrum about the power 
spectrum itself. (S/N) 2 (called simply S/N in Paper I) 
is the Fisher information about a (possibly hypothetical) 
parameter that depends on each mode of the power spec- 
trum equally. Thus, the derivative terms above were set 
to unity. For Ps , the linear-power-spectrum amplitude A 
'e.g. , inves tigated in Rimes & Hamilton 2005 i Neyrinck 



As in Paper I, we use a Fisher information ( Fisher 
1935 Tegmark et al.|1997 ) formalism to quantify the in- 



et al.|2006 ) is a parameter such that c'ln P i " sn /^ln A = 1 
on linear scales, reaching « 2 on translinear scales. The 
situation is more subtle in the case of power spectra of 
nonlincarly-transformed fields, since there is generally a 
large-scale bias (Paper I). But this does not affect pa- 
rameters that depend on the power spectrum's shape. 
And for parameters that depend on the amplitude, the 
large-scale bias can be constrained by measuring both 
Ps and Pq (or Pi og +) in the linear regime. In this pa- 
per, though, we simply investigate the S/N in Pq (and 
Piog+) themselves. This would be entirely appropriate 
when comparing data to a mock catalog, for example. 

Shot noise further complicates the situation. The sta- 
tistically stable shot noise component Si of the power 
spectrum that appears on small scales actually reduces 
the covariance in P[ sn +Si (the power spectrum including 
shot noise), mimicking a gain in clustering information, 
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Fig. 3. — Top. Shot noise in the power spectra of G(<5), and 
log + (<5), with varying grid resolution, for a matter density field 
with a fixed Poisson sampling of n ce n = 1/8 particles per (2- 
h' 1 Mpc) 3 cell. The power spectra are shown at grid resolutions 
of 32 3 (black), 64 3 (yellow), 128 3 (blue), and 256 3 (red). Both the 
power spectra of the Poisson-sampled field and their differences 
from the full-resolution power spectra are shown as solid curves; 
the full-resolution power spectra appear as dashed curves. The 
shot noise is rather consistent at different resolutions, especially in 
the G(8) case. 

Bottom. Ratios of Pg and Pq to the initial power spectrum, nor- 
malized to 1 in the lowest-fc bin. Pq and Pg are at full sampling 
(as in Paper I) , and "Pg , + shot noise" is the raw power spectrum 
of the density field sampled at n co n = 1 on the 256 grid. Ratios 
at two resolutions are shown: 128 3 and 256 3 . 

when really all that's being accurately measured is the 
shot noise. The correct thing to investigate is the infor- 
mation in (P{ sn + Si) about the power spectrum without 
shot noise, P{ an . Thus we investigate 

rs/NV- V QHPj* + (c -x. dHP r + Sj ) 

\P/M) - 2^ flln P.-"" \ TZ Hi r o ln p-sn 



p-sn 
i 

P,- sn + S, 



Pj sn + Si 



(5) 



For the last line, we assume that the shot noise is inde- 
pendent of the clustering fluctuations. In fact, this result 
is roughly what one would get by subtracting the mean 
shot noise from the power spectra before measuring the 
covariance matrix. For example, if the covariance ma- 
trix is diagonal, the Fisher-matrix entries for the power 
spectrum including shot noise will be (C^ 1 )^ = C^ 1 = 
(P[ sn + S , i ) 2 /(AP l " sn ) 2 - Plugging this into Eq. (|) causes 
the fractions to cancel, giving the Fisher-matrix entries 
for thepower spectrum without shot noise. 

Fig. [i] shows a comparison of the (S/N) 2 in P§, Pq, 
and -Pio g +, for Poisson-sampled density fields of various 
resolutions and samplings of the MS. As in Paper I, co- 
variance matrices are estimated from power sp ectra mea- 
sured after applying 248 sinusoidal weightings ( Hamilton 
et al.|2006 ) to the density field. Pq (dashed) out-informs 
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Fig. 4. — Information (S/N) 2 curves in the presence of dis- 
creteness effects, for Pg (solid curves), and Pq, and Pi og + (dashed 
curves), using Eq. AS}. The plethora of curves show the re- 
sults for different combinations of samplings and resolutions. The 
three samplings shown are n ce il = 1/64, 1, and 8 particles per 
(2/i _1 Mpc) 3 cell, in blue, green and red, respectively. These 256 
density grids are degraded in resolution by powers of two, giving 
32 3 , 64 3 , and 128 3 grids. Information curves are measured for 
each case, and symbols are placed at their ends (at the Nyquist 
frequency). All of these differences have little effect on the Pg 
information, but change Pq, and Pi og + significantly. 

P$ (solid) in all cases, although the gains are modest at 
a sampling of n ce ii = 1/64. 

Interestingly, especially for Pq, there appears to be a 
resolution at which the gains in information from Gaus- 
sianization (i.e. the vertical distance between the dashed 
and solid curves) peak. This is not surprising: in the low- 
resolution limit, the field is already Gaussian, so Gaus- 
sianization has no effect. In the high-resolution limit, 
even the highest peaks can only be sampled with one par- 
ticle, giving a field of only O's and l's, which Gaussianiza- 
tion will only cause to be multiplied by a constant. Math- 
ematically, the information without the Si/(P i ~ sn + Si) 
fractions keeps rising, but these fractions can cause it 
to turn over, producing a peak. As we find below in 
Section |4.2| the peak is generally at a resoution a few 
times coarser than that where -P _sn (fcNyq) ~ S'(fcNyq)- 
And particularly if one is interested in the power spec- 
trum over the range of scales just smaller than the linear 
regime (0.1 < fc/(/iMpc _1 ) < 0.3), and not in scraping 
information from smaller scales, it is wise to Gaussianize 
at this peak resolution or coarser. 

4. GALAXY DENSITY FIELDS 

In this section, we address the impacts of discreteness 
and redshift-space distortions on the observationally rel- 
evant case of a galaxy density field, in both real and 
redshift space. Again we use the MS, usi ng the pub- 
licly available galaxy catalog as modelled by |De Lucia fc| 



Blaizot (2007). We use three galaxy samples, with R- 
band absolute-magnitude cuts R < —17, R < —20, and 
R < —22; these have respective mean galaxy number 
densities of about n = 0.003, 0.02, and 0.1 (/i^ 1 Mpc)" 3 . 
We generate galaxy-density grids using NGP density as- 
signment from these galaxy samples. A tiny fraction of 
galaxies exactly overlapped other galaxies in position; in 
this case, we keep only the brightest one. In the redshift- 
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Fig. 5. — In bold, real- (black) and redshift-space (red) power 
spectra of Gaussianized galaxy-density fields, using MS galaxy 
samples satisfying R < —17 (solid), R. < — 20 (dashed), and 
R < —22 (dotted). Shot noise, estimated to be P<3(fc ma x) (see 
text), has been subtracted from them; this constant shot noise for 
each case is shown in faint lines. 

space case, before gridding the galaxies we first displace 
them along the x-axis by v x /H . 

4.1. Effects on the mean 

Fig. [5] shows Pq for galaxy density fields using the 
R < —17, R < —20, and R < —22 samples, in real and 
redshift space. In each case, we have subtracted a shot 
noise, constant in k, and shown with faint lines. In this 
subsection we do not show Pi og + because its shape in all 
cases is nearly identical to Pq. As in the matter power 
spectrum section, in showing G(S), we set the variance 
of the Gaussians onto which the <5's are mapped so that 
the power spectra line up in the lowest k bin. 

In the matter case, the shot noise could be directly 
measured by comparing to a very well-sampled matter 
density field, which we lack in the galaxy case. We es- 
timate the shot noise using a prescription motivated by 
the rough agreement of the shot noise across different 
resolutions in Fig. [2j We measure the power spectrum 
on a rather high-resolution (512 3 ) grid, and assume that 
for all resolutions, the shot noise is a constant with k, 
at an amplitude of PG(k ma _ x ). (Here, fc max is the high- 
est k measured in the high-resolution grid, pin the cu- 
bic box, fc max = \/3fcNyquistj although we only plot the 
power spectra to fcNyquist-) This probably overestimates 
the shot noise somewhat, but perhaps this is appropri- 
ate, at least for Pq, given the slight rise in the shot-noise 
power at slightly lower k than k max (e.g., at k « 0.8 
/iMpc" 1 in Fig.[2|). 

Fig.[6]shows ratios of galaxy power spectra to the linear 
power spectrum P; n it, in both real and redshift space. 
These could be thought of as transfer functions between 
Pinit and z = galaxy power spectra. 

In real space, in the limit of low sampling (in the R < 
—22 sample), Pq and Pg look nearly identical before shot 
noise is subtracted. This is not surprising, as G{5) differs 
not much, after removing a linear scaling, from 8 in this 
limit. But after shot noise is subtracted, even at this 
sampling, Pq seems to track Pi n n a bit better than Pg. 
As the sampling increases (in the R < — 17 sample), Pq 
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Fig. 6. — Nonlinear transfer functions Pc{k) / '-Piincar an d 
Pi(fc)/Pli ncar (fc) for three MS galaxy samples, in real and redshift 
space. The power spectra are measured on 128 3 (green) and 256 3 
(black) grids. PcW/Pimcar is shown, both including (dotted), and 
having subtracted (solid), a shot-noise estimate (see text). Thin 
and bold dashed lines show the same for P$(k) / Pii nca r(fc). 

comes to track P; n i t significantly better, even before shot 
noise is subtracted. 

In redshift space, the story is not as clear. A full anal- 
ysis of the effects of redshift-space distortions on Pq is 
beyond the scope of this paper. One obvious piece of 
analysis that is lacking is that here we merely analyze 
the angle-averaged redshift-space power spectrum. But 
in general, Gaussianization modifies the shape of galaxy 
power spectra less in redshift space than in real space. 
This is likely because the galaxy-density PDF is already 
somewhat Gaussianized because peaks are smeared by 
fingers of God. 

4.2. Effects on Information Content 

Our method in investigating the galaxy power spec- 
trum Fisher information, (S/N) 2 , is essentially the same 
as in the matter case. The difference is that we do not 
have a meaningful measurement of the exact shot noise, 
and so we estimate it as in Section [4. II This estimate is 
conservative (i.e. likely an overestimate) from the point of 
view of information estimation, so the information curves 
for Pq and P\og+ appearing below could be considered 
conservative at high k (i.e. perhaps a slight underesti- 
mate). For Pg, we use the simple 1/n factor. 

Fig. [7] shows (S/N) 2 (fc Nyq ) curves for the real-space 
power spectra Pg, Pq, and P\ g+, for the three MS 
galaxy samples, measured on grid sizes varying from 
16 3 to 256 3 . To reduce clutter, we do not show each 
cumulative (S/N) 2 (k) curve, but just the total cumu- 
lative (S/N) (fcNyq) up to the Nyquist frequency fcN yq . 
In the matter case (Fig. [4j), these appear as symbols 
at information-curve endpomts. The dashed lines show 
(S/N) 2 without taking into account shot noise [i.e. with- 
out the (Pr sn + Si) /Si fractions in Eq. @]. The solid 
lines, for which these fractions are included, are more 
meaningful. 
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Fig. 7. — Total cumulative signal-to-noise (information), up 
to /sNyqi for the galaxy real-space Pq, Pi og _|_, and Pg, computed 
at different resolutions and galaxy samples. Each point comprising 
the curves is analogous to a (S/N) 2 curve endpoint-symbol in Fig.Wl 
The dashed curves show (S/N) 2 in the raw power spectra, while for 
the solid curves, the shot-noise effect has been taken into account, 
as in Eqn. |5j. The vertical dotted lines are at the resolutions 
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power spectrum are closest. 



the shot-noise and clustering components of the 
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Fig. 8. — The redshift-space version of Fig. [7] 

Typically, as suggested in the matter case in Fig. |4j 
there appears to be a peak in the gain in (S/N) 2 (fc Nyq 
provided by the Gaussianization transform. The dotted 
lines are at the resolution where the shot noise is most 
comparable to the clustering signal; here, there is typi- 
cally a trough in the cumulative (S/N) 2 . It appears that 
the resolution that optimizes the gain from Gaussianiz- 
ing is typically a factor of 2-4 coarser than that. But in 
all cases, the Gaussianized power spectra out-inform the 
standard power spectra. 

Fig.[8]shows the same for redshift-space power spectra. 
Here, Pg fares better, likely because in redshift space, 
fingers of God smear out density peaks and already make 
the PDF of 8 more Gaussian. Still, again the power 
spectra of the Gaussianized fields out-inform Pg in all 
cases. 

5. CONCLUSION 

In this paper, we extend our previous analysis of the 
rejuvenating effects that PDF Gaussianization has on the 
matter power spectrum (Paper I). We include discrete- 
ness effects, and look at the observationally relevant case 



of the galaxy density field, both in real and redshift space. 
As in Paper I, we analyze the Millennium Simulation. 

We find that the conclusions of Paper I remain un- 
changed in the presence of discreteness noise, as long as 
one is looking at scales where the shot noise (which Gaus- 
sianization does increase somewhat) is negligible. In real 
space, Gaussianizing the galaxy and discretized matter 
density fields does seem to extend the range over which 
their power spectra trace the linear power spectrum, well 
into the nonlinear regime, until the shot noise becomes 
comparable to the clustering signal. In redshift space, 
Gaussianization also reduces the small-scale rise in the 
galaxy power spectrum. 

Gaussianization removes or reduces the small-scale rise 
that one sees in power spectra relative to the line ar power 
spectrum . In the context of the halo model (e.g., |Cooray| 
& Sheth 2002), this rise is associated with a one-halo 
term. It is perhaps not surprising that Gaussianizing 
would reduce this one-halo term, since haloes are the 
most non-linear, non-Gaussian structures in the Uni- 
verse. However, the removal of this rise in the galaxy 
power spectrum without direct mention of haloes also 
suggests that explaining galaxy bias w ith a non-linear 



transformation of a Gaussian field (e.g., Politzer & Wise 
1984 Szalay 1988 ), which has somewhat gone out of fash 



ion, may be a fruitful area for further study. 

In all cases, Gaussianization also improves the inher- 
ent Fisher information, (S/N) 2 , of the power spectrum. 
But the degree of help it provides depends on the resolu- 
tion of the grid over which the PDF is Gaussianized. It 
seems that the grid size providing the most cumulative 
added (S/N) 2 is a factor of 2-4 coarser than the reso- 
lution where, at the grid's Nyquist frequency, the shot 
noise and clustering components of the power spectrum 
are of comparable magnitude. That is, to reap the in- 
formation gains of Gaussianization on translinear scales, 
one should be careful not to use grid cells that are too 
small. In redshift space, the gains in (S/N) 2 for galaxy 
density fields are somewhat smaller than in real space, if 
the galaxy sampling is high enough to resolve fingers of 
God. This is because fingers of God smear high density 
peaks, producing an already more-Gaussian PDF. 

While discreteness effects are an essential issue to in- 
vestigate in the study of power spectra of Gaussianized 
fields, a few issues still remain. With a variable sur- 
vey selection function, it may be necessary to apply a 
Gaussianization transform seperately in different redshift 
shells. We have made a start at analyzing the effects of 
redshift distortions, but much more work can be done 
in this area. It also remains to be investigated precisely 
how faithfully, and to what scales, the power spectrum of 
the Gaussianized matter and galaxy density fields traces 
the linear power spectrum, for arbitrary cosmologies. 
Put more practically, the Fisher-matrix analysis needs 
to be extended to particular cosmological parameters. 
Also, our assertion that Gaussianization pulls informa- 
tion from higher-point statistics could do with further 
quantitative elucidation. 
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